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METHOD FOR EXPRESSION OF SMALL ANTIVIRAL RNA MOLECULES 

WITHIN A CELL 

Government Support 

[0001] This invention was made with government support under Grant Number 
GM39458 awarded by the National Institutes of Health. The United States Government has 
certain rights in the invention. 

Background of the Invention 

Field of the Invention 

[0002] The present invention relates generally to methods for altering gene 
expression in a cell or animal using viral constructs engineered to deliver an RNA 
molecule, and more specifically to deliver double-stranded RNA molecules that can be used 
to down-regulate or modulate gene expression. Particular aspects of the invention relate to 
down-regulating a pathogenic virus gene or a gene necessary for a pathogenic virus life 
cycle through delivery of a viral construct engineered to express an RNA molecule. 

Description of the Related Art 

[0003] RNA interference (RNAi) or silencing is a recently discovered 
phenomenon (A. Fire et al. 9 Nature 391, 806 (1998); C.E. Rocheleau et al Cell 90, 707 
(1997)). Small interfering RNAs ("siRNAs") are double-stranded RNA molecules that 
inhibit the expression of a gene with which they share homology. siRNAs have been used 
as a tool to down regulate the expression of specific genes in a variety of cultured cells as 
well as in invertebrate animals. A number of such approaches have been reviewed recently 
(P.D. Zamore Science 296, 1265 (2002)); however, such approaches have limitations. For 
example, no technique prior to the invention described herein allows for the generation of 
transgenic mammals having a specific gene down regulated through RNA interference. 
Similarly, there is a need for more robust methods for the introduction of small RNA 
molecules with regulatory function. The invention provided herein addresses these and 
other limitations in the field of RNA mediated gene regulation. Likewise, there is a need 
for improved methods and compositions for the treatment of viruses and diseases associated 
with viral infection. 
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Summary of the Invention 
[0004] The invention relates generally to methods to express within a cell an 
RNA molecule or molecules. These methods can be used with a wide variety of cell types. 
RNA molecules can be expressed within a cell for a variety of purposes. For example, 
RNA molecules can serve as markers within a cell, can be antisense oligonucleotides or 
ribozymes for regulating gene expression, and can serve to down regulate genes through 
RNA interference. 

[0005] In one aspect, the methods of the invention relate to the treatment or 
prevention of infection through the expression of one or more RNA molecules that inhibit 
one or more aspects of the life cycle of a pathogen through RNA interference with a target 
nucleic acid, such as a viral genome, a viral transcript or a host cell gene that is necessary 
for viral replication. 

[0006] According to another aspect of the invention, a method of expressing an 
RNA molecule is provided which includes transfecting a packaging cell line with a 
retroviral construct and recovering recombinant retrovirus from the packaging cell line. A 
host cell is then infected with the recombinant retrovirus. 

[0007] The recombinant retrovirus construct preferably has a first RNA 
polymerase HI promoter region, at least one RNA coding region, and at least one 
termination sequence. The RNA coding region preferably comprises a sequence that is at 
least about 90% identical to a target sequence within the target nucleic acid. Preferably the 
target nucleic is necessary for the life cycle of a pathogen, for example, part of a pathogenic 
virus RNA genome or genome transcript, or part of a target cell gene involved in the life 
cycle of a pathogenic virus. 

[0008] In one embodiment, the methods of the invention are used to disrupt the 
life cycle of a pathogen. In a particular embodiment the methods are used to disrupt the life 
cycle of a virus having an RNA genome, for example a retrovirus, by targeting the RNA 
genome directly. In another embodiment a viral genome transcript is targeted, including 
transcripts of individual viral genes. The methods also can be used to down regulate a gene 
in a host cell, where the gene is involved in the viral life cycle, for example, a receptor or 
co-receptor necessary for viral entry into the host cell. 
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[00091 In one aspect of the invention, the RNA coding region encodes an 
siRNA, preferably a self-complementary "hairpin*' RNA molecule having a sense region, an 
antisense region and a loop region. The loop region is generally between about 2 and about 
15 nucleotides in length, and in a more preferred embodiment is about 6 to about 9 
nucleotides in length. The double-stranded region of the hairpin molecule comprises a 
nucleotide sequence that is homologous to the target sequence. The sequence in the hairpin 
molecule is preferably at least about 90% identical to a target sequence, more preferably at 
least about 95% identical, even more preferably at least about 99% identical. 

[0010] In another embodiment, the RNA coding region encodes a first RNA 
molecule, and the retroviral construct has a second RNA polymerase IH promoter and a 
second RNA coding region operably linked to the second RNA polymerase III promoter. In 
such an embodiment, the second RNA coding region encodes an RNA molecule 
substantially complementary to the first RNA molecule. Upon expression of the first and 
second RNA coding regions, a double-stranded complex is formed within a cell. 

[0011] In yet another embodiment, the retroviral construct can have a second 
RNA polymerase HI promoter region operably linked to the RNA coding region, such that 
expression of the RNA coding region from the first RNA polymerase HI promoter results in 
the synthesis of a first RNA molecule and expression of the RNA coding region from the 
second RNA polymerase HI promoter results in synthesis of a second RNA molecule 
substantially complementary to the first RNA molecule. In one such embodiment, the RNA 
polymerase m promoters are separated from the RNA coding region by termination 
sequences. 

[0012] In one embodiment of the invention, the target cell is an embryonic cell. 
An embryonic cell as used herein includes a single cell embryo, and embryo cells within an 
early-stage embryo. In another embodiment of the invention, the target cell is an 
embryogenic stem cell. When the target cell is an embryonic cell, the embryonic cell can 
be infected by injecting the recombinant retrovirus between the zona pellucida and the cell 
membrane of a mammalian embryonic cell. In another embodiment, the embryonic cell can 
be infected by removing the zona pellucida and incubating the cell in solution containing 
the recombinant retrovirus. In such an embodiment, the zona pellucida can be removed by 
enzymatic digestion. When the target cell is an embryonic cell or an embryogenic stem 
cell, the methods of the invention also include implanting the embryonic cell in a 
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pseudopregnant female to generate a transgenic animal. In such a fashion, a transgenic 
animal can be generated that is resistant to a particular pathogen, such as a virus. 

[0013] The methods of the invention can also be used with a variety of primary, 
ex vivo normal or diseased cells or cells adapted in various tissue culture conditions. The 
cells are preferably obtained from human, mouse or other vertebrates. The cells may 
include, without limitation, hematopoietic stem or precursor cells, central nerve system 
cells, cells with regenerative capacities for a variety of other tissues and organs, dendritic 
cells and other developing and mature myeloid and lymphoid cells, and cancer cells derived 
from different cell lineages. 

[0014] In another aspect the invention provides retroviral constructs for the 
expression of an RNA molecule or molecules within a cell. The constructs preferably 
comprise an RNA polymerase IK (pol HI) promoter. In one embodiment the retroviral 
constructs have an RNA coding region operably linked to the RNA polymerase DI 
promoter. The RNA coding region can be immediately followed by a pol in terminator 
sequence, which directs termination of RNA synthesis by pol HI. The pol III terminator 
sequences generally have 4 or more consecutive thymidine ("T") residues. In a preferred 
embodiment, a cluster of 5 consecutive Ts is used as the terminator by which pol HI 
transcription is stopped at the second or third T of the DNA template, and thus only 2 to 3 
uridine ("U") residues are added to the 3' end of the coding sequence. A variety of pol HI 
promoters can be used with the invention, including for example, the promoter fragments 
derived from HI RNA genes or U6 snRNA genes of human or mouse origin or from any 
other species. In addition, pol HI promoters can be modified/engineered to incorporate 
other desirable properties such as the ability to be induced by small chemical molecules, 
either ubiquitously or in a tissue-specific manner. For example, in one embodiment the 
promoter may be activated by tetracycline. In another embodiment the promoter may be 
activated by IPTG (lad system). 

[0015] The retroviral construct can be based on a number of retroviral vectors. 
In a preferred embodiment, the retroviral construct has the R and U5 sequences from a 5' 
lenti viral long terminal repeat (LTR) and a self-inactivating lentiviral 3' LTR. In another 
embodiment, the retroviral vector is derived from the murine stem cell virus (MSCV). In 
yet another embodiment, the retroviral construct is a hybrid of a lentiviral and a MSCV 
construct. 
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[0016] In a further embodiment, the RNA coding region encodes a self- 
complementary RNA molecule having a sense region, an antisense region and a loop 
region. Such an RNA molecule, when expressed, preferably forms a "hairpin" structure. A 
loop region is generally between about 2 to 15 nucleotides in length. In a preferred 
embodiment, the loop region is from 6 to 9 nucleotides in length. In one such embodiment 
of the invention, the sense region and the antisense region are between about 15 and about 
30 nucleotides in length. In one embodiment, the RNA coding region of this embodiment 
of invention is operably linked downstream to an RNA polymerase HI promoter in such that 
the RNA coding sequence can be precisely expressed without any extra non-coding 
nucleotides present at 5' end (ie., the expressed sequence is identical to the target sequence 
at the 5* end). The synthesis of the RNA coding region is ended at the terminator site. In 
one preferred embodiment the terminator has five consecutive T residues. 

[0017] In another aspect of the invention, the retroviral vector can contain 
multiple RNA coding regions. In one such embodiment, the RNA coding region encodes a 
first RNA molecule, and the retroviral construct has a second RNA polymerase III promoter 
and a second RNA coding region operably linked to the second RNA polymerase EI 
promoter. In this embodiment, the second RNA molecule can be substantially 
complementary to the first RNA molecule, such that the first and the second RNA 
molecules can form a double-stranded structure when expressed. The double stranded 
region of the RNA complex is at least about 90% identical to a target region of either a viral 
genome, a viral genome transcript or a target cell RNA encoding a protein necessary for the 
pathogenic virus life cycle. The methods of invention also include multiple RNA coding 
regions that encode hairpin-like self-complementary RNA molecules or other non-hairpin 
molecules. 

[0018] In yet another embodiment of the invention, the retroviral construct has a 
second RNA polymerase HI promoter operably linked to the same RNA coding region in 
the opposite direction, such that expression of the RNA coding region from the first RNA 
polymerase HI promoter results in a synthesis of a first RNA molecule as the sense strand 
and expression of the RNA coding region from the second RNA polymerase III promoter 
results in synthesis of a second RNA molecule as antisense strand with substantial 
complementarity to the first RNA molecule. In such an embodiment, both RNA molecules 
can contain a 3* overhang of residues encoded by the termination sequence. In one 
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embodiment, both RNA polymerase III promoters are separated from the RNA coding 
region by termination sequences. Preferably the termination sequences comprise five 
consecutive T residues. 

[0019] According to another aspect of the invention^ the 5' LTR sequences can 
be derived from HIV. The retroviral construct can also have a woodchuck hepatitis virus 
enhancer element sequence and/or a tRNA amber suppressor sequence. 

[0020] In one embodiment of the invention, the self-inactivating 3* LTR can be 
a U3 element with a deletion of its enhancer sequence. In yet another embodiment, the self- 
inactivating 3' LTR is a modified HIV 3' LTR. 

[0021] The recombinant retroviral construct can be pseudotyped, for example 
with the vesicular stomatitits virus envelope glycoprotein. 

[0022] According to another aspect of the invention, the viral construct also can 
encode a gene of interest. The gene of interest can be linked to a Polymerase II promoter. 
A variety of Polymerase II promoters can be used with the invention, including for 
example, the CMV promoter. The RNA Polymerase II promoter that is chosen can be a 
ubiquitous promoter, capable of driving expression in most tissues, for example, the human 
Ubiquitin-C promoter, CMV 0-actin promoter and PGK promoter. The RNA Polymerase II 
promoter also can be a tissue-specific promoter. Such a construct also can contain, for 
example, an enhancer sequence operably linked with the Polymerase II promoter. 

[0023] In one embodiment, the gene of interest is a marker or reporter gene that 
can be used to verify that the vector was successfully transfected or transduced and its 
sequences expressed. In one such embodiment, the gene of interest is a fluorescent reporter 
gene, for example, the Green Fluorescent Protein. In yet another embodiment, the gene of 
interest is a drug resistant gene which can be used to select the cells that are successfully 
transduced. For example, the drug resistant gene can be the zeocin resistant gene (zeo). 
The gene of interest also can be a hybrid of a drug resistant gene and a fluorescent reporter 
gene, such as a zeo/gfp fusion. In another aspect of the invention, the gene of interest 
encodes a protein factor that can regulate the transcription activity of inducible pol EI 
promoters. In one of such embodiment, the gene of interest is tetR (repressor for tet 
operon) which regulates tetracycline responsive pol HI promoters. 

[0024] It is another aspect of the invention to provide methods for expressing an 
RNA molecule or molecules within a cell. In one embodiment a packaging cell line is 
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transfected with a retroviral construct of the invention, recombinant retroviral particles are 
recovered from the packaging cell line; and a target cell is infected with the recombinant 
retrovirus particles. According to such methods, the retroviral construct has the R and U5 
sequences from a 5' lentiviral long terminal repeat (LTR), a self-inactivating lentiviral 3' 
LTR, a first RNA polymerase m promoter region and at least one RNA coding region. The 
retroviral construct also can have a termination sequence operably linked to the RNA 
coding region. 

[0025] In a further aspect a method of treating a patient suffering from HIV 
infection is provided. In one embodiment, a CD34-positive target cell is isolated from the 
patient. The target cell is then infected with a recombinant retrovirus recovered from a 
packaging cell line transfected with a retroviral construct of the invention. Preferably, the 
recombinant retroviral construct comprises a first RNA polymerase DI promoter region, at 
least one RNA coding region, and at least one termination sequence. In one embodiment 
the RNA coding region comprises a sequence that is at least about 90% identical to a target 
region of the HIV genome, an HIV genome transcript or a cellular gene that is involved in 
the HIV life cycle. The target region is preferably from about 18 to about 23 nucleotides in 
length. 

[0026] In one embodiment the RNA coding region encodes a hairpin RNA 
molecule. 

[0027] In a preferred embodiment, the RNA coding region is at least about 90% 
identical to a target region of the CCR5 gene or the CXCR4 gene. 

Brief Description of the Drawings 
[0028] Figure 1A shows a schematic diagram of a retroviral vector carrying an 
' expression cassette for RNA expression, termed "RNA cassette" and a "Marker Gene" or 
gene of interest. The RNA expression cassette can be embedded at any permissible sites of 
the retroviral construct either as single copy or multiple tandem copies. In addition, 
although not indicated in the figure, more than one RNA expression cassette may be present 
in the retroviral construct. Figure IB shows a similar construct in which the RNA 
expression cassettes flank a marker gene. 
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[0029] Figure 2 shows a schematic view of an RNA expression cassette having 
a RNA polymerase m promoter 100 linked to an siRNA region 110-130, having a sense 
region 110, a loop region 120, and an antisense region 130, and a terminator sequence 140. 

[0030] Figure 3 shows a schematic view of an RNA expression cassette having 
a RNA polymerase HI promoter 100 linked to a first RNA coding region 110 and a first 
terminator sequence 140 and a second RNA polymerase III promoter 105 linked to a second 
RNA coding region 115 and a second terminator 145. 

[0031] Figure 4 shows a schematic view of an RNA expression cassette having 
a first RNA polymerase EI promoter 100 linked to an RNA coding region 1 10 and a first 
terminator sequence 145. The expression cassette has a second RNA polymerase HI 
promoter 105 linked to the RNA coding region 115, the same sequence as 110 in reverse, 
and a second terminator 140. 

[0032] Figure 5. Schematic illustration of a lacZ siRNA encoding lentiviral 
vector. 5 'LTR: an HIV based lentiviral vector 5' LTR; F: an HIV Flap element; pol HI: a 
human HI -RNA pol HI promoter (-240 to -8); siRNA: a lacZ specific small hairpin RNA 
coding region and its structure and detailed sequence are illustrated below. UbiC: an 
internal human ubiquitinC promoter; GFP: a GFP marker gene driven by UbiC promoter. 
W: a woodchuck RNA regulatory element. 3 'LTR: an HIV based self inactivating 
lentiviral 3' LTR. 

[0033] Figure 6. A lacZ specific siRNA encoded by a lentiviral vector can 
efficiently inhibit the expression of lacZ reporter gene in virus transduced mammalian cells. 
MEF: mouse embryonic fibroblasts; HEK293: human embryonic kidney cells. Both of the 
test cell lines harbor lacZ and firefly luciferase reporter genes, and the expression levels of 
the reporter genes can be measured by chemiluminescent assays. Ctrl: the ratio of lacZ 
activity versus Luc activity of the uninfected parental cells, which was arbitrarily set to 1. 
Transduced: the specific inhibition of lacZ expression calculated as the reduction of lacZ to 
Luc ratio. 

[0034] Figure 7. Transgenic animals that express a lacZ specific siRNA 
molecule encoded by a lentiviral vector can successfully suppress the expression of the 
ubiquitous lacZ reporter gene in a ROSA26+/- background. ROSA 1-6: the lacZ activities 
in the limb tissues of six El 7.5 ROSA26+/- embryos which served as positive controls. 
The difference in lacZ activity between individual F.OSA26+/- embryos may result from 
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variable protein extraction efficiency. TG1-4: the lacZ activities in the limb tissues of four 
El 7.5 transgenic embryos expressing a lentiviral vector-encoded lacZ siRNA molecule in 
ROSA+/- background. WT1-6: lacZ activity in the limb tissues of six El 7.5 C57B1/6 
wildtype embryos, included as the negative control. The background levels of endogenous 
beta-galactosidase activity are general below 1,000 LU/ug, thus the columns are not visible. 

[0035J Figure 8 shows a schematic illustration of a Tet-inducible lacZ siRNA 
lentiviral vector. A Tet repressor gene (TetR; SEQ ID NO: 7) is the under the control 
human UbiquitinC promoter and its expression can be monitored by the downstream GFP 
marker coupled by IRES element (internal ribosomal entry site). The anti-lacZ siRNA 
cassette is driven by a Tet-inducible pol HI promoter derived from human U6-promoter (- 
328 to +1) containing a single TetR binding site (TetOl) between the PSE and TATA box 
(SEQ ID NO: 6). In the absence of tetracycline, TetR binds to the promoter and its 
expression is repressed. Upon the addition of tetracycline, TetR is moved from the 
promoter and transcription starts. 

[0036] Figure 9 shows the results of an experiment that demonstrated that a Tet- 
inducible siRNA expression cassette can regulate gene expression in response to 
Doxycycline treatment. lacZ and luciferase double expressing HEK293 cells (293Z+Luc) 
were transduced with a lentiviral vector carrying a Tet-inducible lacZ-siRNA cassette and a 
Tet repressor under the control of a UbiquitinC promoter (Figure 8). The transduced cells 
were treated with 10 ug/ml Doxycycline (Plus Dox) for 48hr or without the Doxycycline 
treatment as a control (No Dox). LacZ and luciferase activities were measured as described 
in the previous figures. The relative suppression activity is calculated as the ratio of lacZ 
versus luciferase and No Dox control was arbitrarily set to 1 . 

[0037] Figure 10 shows a schematic illustration of an anti-human CCR5 siRNA 
encoding lentiviral vector. 5'LTR: an HIV based lentiviral vector 5' LTR; F: an HIV Flap 
element; a human U6-RNA pol m promoter (-328 to +1); siRNA: a human CCR5 specific 
short hairpin cassette and its structure and detailed sequence are illustrated below. UbiC: an 
internal human UbiquitinC promoter; GFP: a GFP marker gene driven by UbiC promoter. 
W: a woodchuck RNA regulatory element. 3'LTR: an HIV based self-inactivating 
lentiviral 3* LTR. 

[0038] Figure 11. A anti-human CCR5 specific siRNA encoded by a lentiviral 
vector can efficiently suppress the expression of CCR5 in transduced human cells. Cell 
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surface expression of CCR5 on transduced or untransduced MAGI-CCR5 (Deng, et aL, 
Nature, 381, 661 (1996)) was measured by flow cytometric analysis (FACS) and the 
relative expression levels were calculated by mean fluorescence intensity. A non-specific 
siRNA was also included as a control. 

[0039] Figure 12. Schematic illustration of an anti-HTV-1 siRNA encoding 
lenti viral vector. 5 'LTR: an HIV based lentiviral vector 5' LTR; F: an HIV Flap element; a 
human Hl-RNA pol m promoter (-240 to-9); siRNA: a HIV-1 Rev gene specific short 
hairpin cassette and its structure and detailed sequence are illustrated below. UbiC: an 
internal human ubiquitinC promoter; GFP: a GFP marker gene driven by UbiC promoter. 
W: a woodchuck RNA regulatory element. 3'LTR: an HIV based self inactivating 
lentiviral 3' LTR. 

[0040] Figure 13 demonstrates that an anti-HIV-1 Rev gene specific siRNA 
encoded by a lentiviral vector can efficiently suppress the expression of HIV transcription 
in human cells. The transcription activity of HIV-1 virus is measured a firefly luciferase 
reporter gene inserted at the env/nef region (Li, et al J Virol., 65, 3973 (1991)). The 
luciferase activity of the untransduced parental cells was arbitrarily set to 1 and the relative 
HIV transcription levels of the transduced cells were calculated accordingly. A non- 
specific siRNA containing vector was included as a control. 

[0041] Figure 14 shows a schematic diagram of a bivalent retroviral vector 
carrying both anti-HIV Rev and anti-human CCR siRNA expression cassettes. Symbols 
are the same as depicted in the previous figures. 

Detailed Description of the Preferred Embodiment 
[0042] The inventors have identified a method for introducing a transgene of 
interest into a cell or animal. This technique is described in copending U.S. provisional 
patent application 60/322,031 filed on 9/13/2001 and copending U.S. provisional patent 
application 60/347,782 filed on 1/9/2002, the entire contents of which are incorporated 
herein by reference. 

[0043] Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Any methods, devices and materials similar or equivalent to those 
described herein can be used in the practice of this invention. 
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[0044] By "transgene" is meant any nucleotide sequence, particularly a DNA 
sequence, that is integrated into one or more chromosomes of a host cell by human 
intervention, such as by the methods of the present invention. In one embodiment, a 
transgene is an "RNA coding region." In another embodiment the transgene comprises a 
"gene of interest." In other embodiments the transgene can be a nucleotide sequence, 
preferably a DNA sequence, that is used to mark the chromosome where it has integrated. 
In this situation, the transgene does not have to comprise a gene that encodes a protein that 
can be expressed. 

[0045] A "gene of interest" is a nucleic acid sequence that encodes a protein or 
other molecule that is desirable for integration in a host cell. In one embodiment, the gene 
of interest encodes a protein or other molecule the expression of which is desired in the host 
cell. In this embodiment, the gene of interest is generally operatively linked to other 
sequences that are useful for obtaining the desired expression of the gene of interest, such 
as transcriptional regulatory sequences. 

[0046] A "functional relationship" and "operably linked" mean, without 
limitation, that the gene is in the correct location and orientation with respect to the 
promoter and/or enhancer that expression of the gene will be affected when the promoter 
and/or enhancer is contacted with the appropriate molecules. 

[0047] An "RNA coding region" is a nucleic acid that can serve as a template 
for the synthesis of an RNA molecule, such as an siRNA. Preferably, the RNA coding 
region is a DNA sequence. 

[0048] A "small interfering RNA" or "siRNA" is a double-stranded RNA 
molecule that is capable of inhibiting the expression of a gene with which it shares 
homology. The region of the gene or other nucleotide sequence over which there is 
homology is known as the "target region." In one embodiment the siRNA may be a 
"hairpin" or stem-loop RNA molecule, comprising a sense region, a loop region and an 
antisense region complementary to the sense region. In other embodiments the siRNA 
comprises two distinct RNA molecules that are non-covalently associated to form a duplex. 

[0049] The term "animal" is used in its broadest sense and refers to all animals 
including mammals, birds, fish, reptiles and amphibians. 

[0050] The term "mammal" refers to all members of the class Mammalia and 
includes any animal classified as a mammal, including humans, domestic and farm animals, 
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and zoo, sports or pet animals, such as mouse, rabbit, pig, sheep, goat, cattle and higher 
primates. 

[0051] "Target cell" or "host cell" means a cell that is to be transformed using 
the methods and compositions of the invention. 

[0052] The term "pathogenic virus" is used herein to indicate a virus capable of 
infecting an animal. 

[0053] "Retroviruses" are viruses having an RNA genome. 

[0054] "Lentivirus" refers to a genus of retroviruses that are capable of infecting 
dividing and non-dividing cells. Several examples of lentiviruses include HTV (human 
immunodeficiency virus: including HIV type 1, and HTV type 2), the etiologic agent of the 
human acquired immunodeficiency syndrome (AIDS); visna-maedi, which causes 
encephalitis (visna) or pneumonia (maedi) in sheep, the caprine arthritis-encephalitis virus, 
which causes immune deficiency, arthritis, and encephalopathy in goats; equine infectious 
anemia virus, which causes autoimmune hemolytic anemia, and encephalopathy in horses; 
feline immunodeficiency virus (FIV), which causes immune deficiency in cats; bovine 
immune deficiency virus (BIV), which causes lymphadenopathy, lymphocytosis, and 
possibly central nervous system infection in cattle; and simian immunodeficiency virus 
(SIV), which cause immune deficiency and encephalopathy in sub-human primates. 

[0055] A "hybrid virus" as used herein refers to a virus having components 
from one or more other viral vectors, including element from non-retroviral vectors, for 
example, adenoviral-retroviral hybrids. As used herein hybrid vectors having a retroviral 
component are to be considered within the scope of the retroviruses. 

[0056] A lenti viral genome is generally organized into a 5' long terminal repeat 
(LTR), the gag gene, the pol gene, the env gene, the accessory genes (nef, vif, vpr, vpu) and 
a 3' LTR. The viral LTR is divided into three regions called U3, R and U5. The U3 region 
contains the enhancer and promoter elements. The U5 region contains the polyadenylation 
signals. The R (repeat) region separates the U3 and U5 regions and transcribed sequences 
of the R region appear at both the 5' and 3' ends of the viral RNA. See, for example, "RNA 
Viruses: A Practical Approach" (Alan J. Canr., Ed., Oxford University Press, (2000)), O 
Narayan and Clements J. Gen. Virology 70:1617-1639 (1989), Fields et al. Fundamental 
Virology Raven Press. (1990), Miyoshi H, Blomer U, Takahashi M, Gage FH, Verma IM. 
J Virol 72(10):8150-7 (1998), and U.S. Patent No. 6,013,516. 
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[0057] Lentiviral vectors are known in the art, including several that have been 
used to transfect hematopoietic stem cells. Such vectors can be found, for example, in the 
following publications, which are incorporated herein by reference: Evans JT et al. Hum 
Gene Ther 1999;10:1479-1489; Case SS, Price MA, Jordan CT et al. Proc Natl Acad Sci 
USA 1999;96:2988-2993; Uchida N, Sutton RE, Friera AM et al. Proc Natl Acad Sci USA 
1998;95:11939-11944; Miyoshi H, Smith KA, Mosier DE et al. Science 1999;283:682-686; 
Sutton RE, Wu HT, Rigg R et al. Human immunodeficiency virus type 1 vectors efficiently 
transduce human hematopoietic stem cells. J Virol 1998;72:5781-5788. 

[0058] "Virion," "viral particle" and "retroviral particle" are used herein to refer 
to a single virus comprising an RNA genome, pol gene derived proteins, gag gene derived 
proteins and a lipid bilayer displaying an envelope (glyco)protein. The RNA genome is 
usually a recombinant RNA genome and thus may contain an RNA sequence that is 
exogenous to the native viral genome. The RNA genome may also comprise a defective 
endogenous viral sequence. 

[0059] A "pseudotyped" retrovirus is a retroviral particle having an envelope 
protein that is from a virus other than the virus from which the RNA genome is derived. 
The envelope protein may be from a different retrovirus or from a non-retroviral virus. A 
preferred envelope protein is the vesicular stomatitius virus G (VSV G) protein. However, 
to eliminate the possibility of human infection, viruses can alternatively be pseudotyped 
with ecotropic envelope protein that limit infection to a specific species, such as mice or 
birds. For example, in one embodiment, a mutant ecotropic envelope protein is used, such 
as the ecotropic envelope protein 4.17 (Powell et al. Nature Biotechnology 18(12):1279- 
1282 (2000)). 

[0060] The term "provirus" is used to refer to a duplex DNA sequence present 
in a eukaryotic chromosome that corresponds to the genome of an RNA retrovirus. The 
provirus may be transmitted from one cell generation to the next without causing lysis or 
destruction of the host cell. 

[0061] A "self-inactivating 3' LIR" is a 3' long terminal repeat (LTR) that 
contains a mutation, substitution or deletion that prevents the LTR sequences from driving 
expression of a downstream gene. A copy of the U3 region from the 3* LTR acts as a 
template for the generation of both LTR's in the integrated provirus. Thus, when the 3' 
LTR with an inactivating deletion or mutation integrates as the 5' LTR of the provirus, no 
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transcription from the 5* LTR is possible. This eliminates competition between the viral 
enhancer/promoter and any internal enhancer/promoter. Self-inactivating 3' LTRs are 
described, for example, in Zufferey et al. /. Virol. 72:9873-9880 (1998), Miyoshi et al. J. 
Virol. 72:8150-8157 and Iwakuma et al. Virology 261:120-132 (1999). 

[00621 The tenn "RNA interference or silencing" is broadly defined to include 
all posttranscriptional and transcriptional mechanisms of RNA mediated inhibition of gene 
expression, such as those described in P.D. Zamore Science 296, 1265 (2002). 

[0063] "Substantial complementarity" and "substantially complementary" as 
used herein indicate that two nucleic acids are at least 80% complementary, more 
preferably at least 90% complementary and most preferably at least 95% complementary 
over a region of more than about 15 nucleotides and more preferably more than about 19 
nucleotides. 

[0064] In one aspect of the invention, a recombinant retrovirus is used to deliver 
an RNA coding region of interest to a cell, preferably a mammalian cell. The cell may be a 
primary cell or a cultured cell. In one embodiment the cell is an oocyte or an embryonic 
cell, more preferably a one-cell embryo. In another embodiment the cell is a hematopoietic 
stem cell. The RNA coding region and any associated genetic elements are thus integrated 
into the genome of the host cell as a provirus. When the target cell is an embryo, the cell 
may then be allowed to develop into a transgenic animal by methods well known in the art. 

[0065] The recombinant retrovirus used to deliver the RNA coding region is 
preferably a modified lentivirus, and thus is able to infect both dividing and non-dividing 
cells. The recombinant retrovirus preferably comprises a modified lentiviral genome that 
includes an RNA coding region. Further, the modified lentiviral genome preferably lacks 
endogenous genes for proteins required for viral replication, thus preventing undesired 
replication, such as replication in the target cells. The required proteins are preferably 
provided in trans in the packaging cell line during production of the recombinant retrovirus, 
as described below. 

[0066] In another embodiment, the recombinant retrovirus used to deliver the 
RNA coding region is a modified Moloney virus, for example a Moloney Murine Leukemia 
Virus. In a further embodiment, the virus is a Nt urine Stem Cell Virus (Hawley, R. G., et 
al. (1996) Proc. Natl. Acad. Sci. USA 93:10297-10302; Keller, G., et al. (1998) Blood 
92:877-887; Hawley, R. G., et al. (1994) Gen* Ther. 1:136-138). The recombinant 

-14- 



WO 03/023015 



PCT/US02/29214 



retrovirus also can be a hybrid virus such as that described in Choi, JK; Hoanga, N; Vilardi, 
AM; Conrad, P; Emerson, SG; Gewirtz, AM. (2001) Hybrid HIV/MSCV LTR Enhances 
Transgene Expression of Lentiviral Vectors in Human CD34+ Hematopoietic Cells. Stem 
Cells 19, No. 3, 236-246. 

[0067] In one embodiment the transgene, preferably an RNA coding region, is 
incorporated into a viral construct that comprises an intact retroviral 5' LTR and a self- 
inactivating 3' LTR. The viral construct is preferably introduced into a packaging cell line 
that packages viral genomic RNA based on the viral construct into viral particles with the 
desired host specificity. Viral particles are collected and allowed to infect the host cell. 
Each of these aspects is described in detail below. 

The Viral Construct 

[0068] The viral construct is a nucleotide sequence that comprises sequences 
necessary for the production of recombinant viral particles in a packaging cell. In one 
embodiment the viral construct additionally comprises genetic elements that allow for the 
desired expression of a gene of interest in the host. 

[0069] Generation of the viral construct can be accomplished using any suitable 
genetic engineering techniques well known in the art, including, without limitation, the 
standard techniques of PCR, oligonucleotide synthesis, restriction endonuclease digestion, 
ligation, transformation, plasmid purification, and DNA sequencing, for example as 
described in Sambrook et al. (Molecular Cloning: A Laboratory Manual. Cold Spring 
Harbor Laboratory Press, N.Y. (1989)), Coffin et al. (Retroviruses. Cold Spring Harbor 
Laboratory Press, N.Y. (1997)) and "RNA Viruses: A Practical Approach" (Alan J. Cann, 
Ed., Oxford University Press, (2000)). 

[0070] The viral construct may incorporate sequences from the genome of any 
known organism. The sequences may be incorporated in their native form or may be 
modified in any way. For example, the sequences may comprise insertions, deletions or 
substitutions. In a preferred embodiment the viral construct comprises sequences from a 
lentivirus genome, such as the HIV genome or the SIV genome. In another preferred 
embodiment, the viral construct comprises sequences of a murine stem cell virus (MSCV). 

[0071] The viral construct preferably comprises sequences from the 5' and 3' 
LTRs of a lentivirus, a moloney murine leukemia virus, a murine stem cell virus or hybrids 
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thereof. In one embodiment, the viral construct comprises the R and U5 sequences from 
the 5* LTR of a lentivirus and an inactivated or self-inactivating 3* LTR from a lentivirus. 
The LTR sequences may be LTR sequences from any lentivirus from any species. For 
example, they may be LTR sequences from HIV, SIV, FIV or BIV. Preferably the LTR 
sequences are HIV LTR sequences. The virus also can incorporate sequences from MMV 
orMSCV. 

[0072] The viral construct preferably comprises an inactivated or self- 
inactivating 3' LTR. The 3* LTR may be made self-inactivating by any method known in 
the art. In one embodiment the U3 element of the 3' LTR contains a deletion of its 
enhancer sequence, preferably the TATA box, Spl and NF-kappa B sites. As a result of the 
self-inactivating 3* LTR, the pro virus that is integrated into the host cell genome will 
comprise an inactivated 5' LTR. 

[0073] Optionally, the U3 sequence from the lentiviral 5' LTR may be replaced 
with a promoter sequence in the viral construct. This may increase the titer of virus 
recovered from the packaging cell line. An enhancer sequence may also be included. Any 
enhancer/promoter combination that increases expression of the viral RNA genome in the 
packaging cell line may be used. In one such embodiment the CMV enhancer/promoter 
sequence is used (U.S. Patent No. 5,168,062; Karasuyama et al J. Exp. Med. 169:13 (1989). 

[0074] The viral construct also comprises a transgene. The transgene, may be 
any nucleotide sequence, including sequences that serve as markers for the provirus. 
Preferably the transgene comprises one or more RNA coding regions and/or one or more 
genes of interest. 

[0075] In the preferred embodiment the transgene comprises at least one RNA 
coding region. Preferably the RNA coding region is a DNA sequence that can serve as a 
template for the expression of a desired RNA molecule in the host cell. In one 
embodiment, the viral construct comprises two or more RNA coding regions. 

[0076] The viral construct also preferably comprises at least one RNA 
Polymerase EI promoter. The RNA Polymerase III promoter is operably linked to the RNA 
coding region and can also be linked to a termination sequence. In addition, more than one 
RNA Polymerase HI promoter may be incorporated. 

[0077] RNA polymerase IE promoters are well known to one of skill in the art. 
A suitable range of RNA polymerase HI promoters can be found, for example, in Paule and 
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White. Nucleic Acids Research., Vol 28, pp 1283-1298 (2000), which is hereby 
incorporated by reference in its entirety. The definition of RNA polymerase HI promoters 
also include any synthetic or engineered DNA fragment that can direct RNA polymerase m 
to transcribe its downstream RNA coding sequences. Further, the RNA polymerase HI (Pol 
HI) promoter or promoters used as part of the viral vector can be inducible. Any suitable 
inducible Pol HI promoter can be used with the methods of the invention. Particularly 
suited Pol m promoters include the tetracycline responsive promoters provided in Ohkawa 
and Taira Human Gene Therapy, Vol. 1 1, pp 577-585 (2000) and in Meissner et al Nucleic 
Acids Research, Vol. 29, pp 1672-1682 (2001), which are incorporated herein by reference. 

[0078] In one embodiment the viral construct further comprises a gene that 
encodes a protein that is desirably expressed in one or more of the target cells, for example, 
a reporter or marker protein. Preferably the gene of interest is located between the 5* LTR 
and 3' LTR sequences. Further, the gene of interest is preferably in a functional 
relationship with other genetic elements, for example transcription regulatory sequences 
such as promoters and/or enhancers, to regulate expression of the gene of interest in a 
particular manner once the gene of interest is incorporated into the target cell genome. In 
certain embodiments, the useful transcriptional regulatory sequences are those that are 
highly regulated with respect to activity, both temporally and spatially. 

(0079] Preferably the gene of interest is in a functional relationship with an 
internal Polymerase II promoter/enhancer regulatory sequences. An "internal" 
promoter/enhancer is one that is located between the 5* LTR and the 3* LTR sequences in 
the viral construct and is operably linked to the gene that is desirably expressed. 

[0080] The Polymerase II promoter/enhancer may be any promoter, enhancer or 
promoter/enhancer combination known to increase expression of a gene with which it is in 
a functional relationship. A "functional relationship" and "operably linked" mean, without 
limitation, that the transgene or RNA coding region is in the correct location and 
orientation with respect to the promoter and/or enhancer that expression of the gene will be 
affected when the promoter and/or enhancer is contacted with the appropriate molecules. 

[0081] In another embodiment, the gene of interest is a gene included for safety 
concerns to allow for the selective killing of the treated target cells within a heterogeneous 
population, for example within an animal, or more particularly within a human patient. In 
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one such embodiment, the gene of interest is a thymidine kinase gene (TK) the expression 
of which renders a target cell susceptible to the action of the drug gancyclovir. 

[0082] In addition, more than one gene of interest may be placed in functional 
relationship with the internal promoter. For example a gene encoding a marker protein may 
be placed after the primary gene of interest to allow for identification of cells that are 
expressing the desired protein. In one embodiment a fluorescent marker protein, preferably 
green fluorescent protein (GFP), is incorporated into the construct along with the gene of 
interest. If a second reporter gene is included, an internal ribosomal entry site (IRES) 
sequence is also preferably included (U.S. Patent No. 4,937,190). The IRES sequence may 
facilitate the expression of the reporter gene. 

[0083] The viral construct may also contain additional genetic elements. The 
types of elements that may be included in the construct are not limited in any way and will 
be chosen by the skilled practitioner to achieve a particular result. For example, a signal 
that facilitates nuclear entry of the viral genome in the target cell may be included. An 
example of such a signal is the HIV-1 flap signal. 

[0084] Further, elements may be included that facilitate the characterization of 
the provirus integration site in the genome of the animal. For example, a tRNA amber 
suppressor sequence maybe included in the construct. 

[0085] In addition, the construct may contain one or more genetic elements 
designed to enhance expression of the gene of interest. For example, a woodchuck hepatitis 
virus responsive element (WRE) may be placed into the construct (Zufferey et al. J. Virol 
74:3668-3681 (1999); Deglon et al. Hum. Gene Ther. 11:179-190 (2000)). 

[0086] A chicken 0-globin insulator (Chung et al. Proc. Natl Acad. Scl USA 
94:575-580 (1997)) may also be included in the viral construct. This element has been 
shown to reduce the chance of silencing the integrated provirus in a target cell due to 
methylation and heterochromatinization effects. In addition, the insulator may shield the 
internal enhancer, promoter and exogenous' gene from positive or negative positional 
effects from surrounding DNA at the integration site on the chromosome. 

[0087] Any additional genetic elements are preferably inserted 3* of the gene of 
interest or RNA coding region. 

[0088] In a specific embodiment, the viral vector comprises: an RNA pol EI 
promoter sequence; the R and U5 sequences from the HIV 5' LTR; the HIV-1 flap signal; 
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an internal enhancer; an internal promoter; a gene of interest; the woodchuck hepatitis virus 
responsive element; a tRNA amber suppressor sequence; a U3 element with a deletion of its 
enhancer sequence; the chicken p-globin insulator; and the R and U5 sequences of the 3' 
HIVLTR. 

[0089] The viral construct is preferably cloned into a plasmid that may be 
transfected into a packaging cell line. The preferred plasmid preferably comprises 
sequences useful for replication of the plasmid in bacteria. 

[0090] Schematic diagrams of exemplary retroviral constructs are shown in 
Figures 1A and IB. 

Production of Virus 

[0091] Any method known in the art may be used to produce infectious 
retroviral particles whose genome comprises an RNA copy of the viral construct described 
above. 

[0092] Preferably, the viral construct is introduced into a packaging cell line. 
The packaging cell line provides the viral proteins that are required in trans for the 
packaging of the viral genomic RNA into viral particles. The packaging cell line may be 
any cell line that is capable of expressing retroviral proteins. Preferred packaging cell lines 
include 293 (ATCC CCL X), HeLa (ATCC CCL 2), D17 (ATCC CCL 183), MDCK 
(ATCC CCL 34), BHK (ATCC CCL-10) and Cf2Th (ATCC CRL 1430). The most 
preferable cell line is the 293 cell line. 

[0093] The packaging cell line may stably express the necessary viral proteins. 
Such a packaging cell line is described, for example, in U.S. Patent No. 6,218,181. 
Alternatively a packaging cell line may be transiently transfected with plasmids comprising 
nucleic acid that encodes the necessary viral proteins. 

[0094] In one embodiment a packaging cell line that stably expresses the viral 
proteins required for packaging the RNA genome is transfected with a plasmid comprising 
the viral construct described above. 

[0095] In another embodiment a packaging cell line that does not stably express 
the necessary viral proteins is co-transfected with two or more plasmids essentially as 
described in Yee et al. {Methods Cell Biol 43A, 99-112 (1994)). One of the plasmids 
comprises the viral construct comprising the RNA coding region. The other plasmid(s) 
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comprises nucleic acid encoding the proteins necessary to allow the cells to produce 
functional virus that is able to infect the desired host cell. 

[0096] The packaging cell line may not express envelope gene products. In this 
case the packaging cell line will package the viral genome into particles that lack an 
envelope protein. As the envelope protein is responsible, in part, for the host range of the 
viral particles, the viruses are preferably pseudotyped. Thus the packaging cell line is 
preferably transfected with a plasmid comprising sequences encoding a membrane- 
associated protein that will permit entry of the virus into a host cell. One of skill in the art 
will be able to choose the appropriate pseudotype for the host cell that is to be used. For 
example, in one embodiment the viruses are pseudotyped with the vesicular stomatitis virus 
envelope glycoprotein (VSVg). In addition to conferring a specific host range this 
pseudotype may permit the virus to be concentrated to a very high titer. Viruses can 
alternatively be pseudotyped with ecotropic envelope proteins that limit infection to a 
specific species, such as mice or birds. For example, in another embodiment, a mutant 
ecotropic envelope protein is used, such as the ecotropic envelope protein 4.17 (Powell et 
al. Nature Biotechnology 18(12):1279-1282 (2000)). 

[0097] In the preferred embodiment a packaging cell line that does not stably 
express viral proteins is transfected with the viral construct, a second vector comprising the 
HTV-1 packaging vector with the env 9 nef 9 5'LTR, 3'LTR and vpu sequences deleted, and a 
third vector encoding an envelope glycoprotein. Preferably the third vector encodes the 
VSVg envelope glycoprotein. 

[0098] In another embodiment of invention, RNA interference activity of the 
packaging cells is suppressed to improve the production of recombinant virus. This 
includes, without limitation, the use of cotransfection or stable transfection of constructs 
expressing siRNA molecules to inhibit Dicer, an RNase HI family member of ribonuclease 
which is essential for RNA interference (Hammond et al. Nat. Rev. Genet. 2:110-119 
(2001)). 

[0099] The recombinant virus is then preferably purified from the packaging 
cells, titered and diluted to the desired concentration. 
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Delivery of the Virus 

[0100] The virus may be delivered to the cell in any way that allows the virus to 
infect the cell. Preferably the virus is allowed to contact the cell membrane. A preferred 
method of delivering the virus to mammalian cells is through direct contact. 

[0101 J In one embodiment, the target cells are preferably contacted with the 
virus in culture plates. The virus may be suspended in media and added to the wells of a 
culture plate. The media containing the virus may be added prior to the plating of the cells 
or after the cells have been plated. Preferably cells are incubated in an appropriate amount 
of media to provide viability and to allow for suitable concentrations of virus in the media 
such that infection of the host cell occurs. 

[0102] The cells are preferably incubated with the virus for a sufficient amount 
of time to allow the virus to infect the cells. Preferably the cells are incubated with virus 
for at least 1 hour, more preferably at least 5 hours and even more preferably at least 10 
hours. 

[0103] In any such embodiments, any concentration of virus that is sufficient to 
infect the cell may be used. When the target cell is to be cultured, the concentration of the 
viral particles is at least 1 pfu/|il, more preferably at least 1 0 pfii/jal, even more preferably at 
least 400 pfii/nl and even more preferably at least 1 x 10 4 pfu/p.1. 

[0104] Following infection with the virus, the cells can be introduced into an 
animal. The location of introduction of cultured cells will depend on the cell type used. 
For example, when the cells are hematopoietic cells, the cells can be introduced into the 
peripheral blood stream. The cells introduced into an animal are preferably cells derived 
from that animal, to avoid an adverse immune response. Cells also can be used that are 
derived from a donor animal having a similar immune makeup. Other cells also can be 
used, including those designed to avoid an immunogenic response. 

[0105] In another embodiment, a suitable amount of virus is introduced into an 
animal directly, for example though injection into the body. In one such embodiment, the 
viral particles are injected into the animal's peripheral blood stream. Other injection 
locations also are suitable. Depending on the type of virus, introduction can be carried out 
through other means including for example, inhalation, or direct contact with epithelial 
tissues, for example those in the eye, mouth or skin. 
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[0106] The cells and animals incorporating introduced cells may be analyzed, 
for example for integration of the RNA coding region, the number of copies of the RNA 
coding region that integrated, and the location of the integration. Such analysis may be 
carried out at any time and may be carried out by any methods known in the art. Standard 
techniques are described, for example, in Hogan et al. (supra). 

[0107] The methods of infecting cells disclosed above do not depend upon 
species-specific characteristics of the cells. As a result, they are readily extended to all 
mammalian species. 

[0108] As discussed above, the modified retrovirus can be pseudotyped to 
confer upon it a broad host range. One of skill in the art would also be aware of appropriate 
internal promoters to achieve the desired expression of a gene of interest in a particular 
animal species. Thus, one of skill in the art will be able to modify the method of infecting 
cells derived from any species. 

Down-regulating Gene Expression in a Target Cell 

[0109] The methods described herein allow the expression of RNA molecules in 
cells, and are particularly suited to the expression of small RNA molecules, which can not 
be readily expressed from a Pol II promoter. According to a preferred embodiment of the 
invention, an RNA molecule is expressed within a cell in order to down-regulate the 
expression of a target gene. The ability to down-regulate a target gene has many 
therapeutic and research applications, including identifying the biological functions of 
particular genes. Using the techniques and compositions of the invention, it will be 
possible to knock-down (or down-regulate) the expression of a large number of genes, both 
in cell culture and in mammalian organisms. In particular, it is desirable to down-regulate 
genes in a target cell that are necessary for the life cycle of a pathogen, such as a pathogenic 
virus. 

[0110] In preferred embodiments of the invention, an RNA expression cassette 
comprises a Pol HI promoter and an RNA coding region. The RNA coding region 
preferably encodes an RNA molecule that is capable of down-regulating the expression of a 
particular gene or genes. The RNA molecule encoded can, for example, be complementary 
to the sequence of an RNA molecule encoding a gene to be down-regulated. In such an 
embodiment, the RNA molecule is designed to act through an antisense mechanism. 
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[0111] A more preferred embodiment involves the expression of a double- 
stranded RNA complex, or an RNA molecule having a stem-loop or a so-called "hairpin" 
structure. As used herein, the term "RNA duplex" refers to the double stranded regions of 
both the RNA complex and the double-stranded region of the hairpin or stem-lop structure. 
An RNA coding region can encode a single stranded RNA, two or more complementary 
single stranded RNAs or a hairpin forming RNA. 

[0112] Double stranded RNA has been shown to inhibit gene expression of 
genes having a complementary sequence through a process termed RNA interference or 
suppression (see, for example, Hammond et al. Nat. Rev. Genet. 2:1 10-1 19 (2001)). 

[01 13] According to the invention, the RNA duplex or siRNA corresponding to 
a region of a gene to be down-regulated is expressed in the cell. The RNA duplex is 
substantially identical (typically at least about 80% identical, and more typically at least 
about 90% identical) in sequence to the sequence of the gene targeted for down regulation. 
siRNA duplexes are described, for example, in Bummelkamp et al. Science 296:550-553 
(2202), Caplen et al. Proc. Natl. Acad. Sci. USA 98:9742-9747 (2001) and Paddison et al. 
Genes &Devel. 16:948-958(2002). 

[0114] The RNA duplex is generally at least about 15 nucleotides in length and 
is preferably about 15 to about 30 nucleotides in length. In some organisms, the RNA 
duplex can be significantly longer. In a more preferred embodiment, the RNA duplex is 
between about 19 and 22 nucleotides in length. The RNA duplex is preferably identical to 
the target nucleotide sequence over this region. 

[0115] When the gene to be down regulated is in a family of highly conserved 
genes, the sequence of the duplex region can be chosen with the aid of sequence 
comparison to target only the desired gene. If there is sufficient identity among a family of 
homologous genes within an organism, a duplex region can be designed that would down 
regulate a plurality of genes simultaneously. 

[0116] The duplex RNA can be expressed in a cell from a single retroviral 
construct. In the preferred embodiment, a single RNA coding region in the construct is a 
serves as a template for the expression of a self-complementary hairpin RNA, comprising a 
sense region, a loop region and an antisense region. This embodiment is illustrated in 
Figure 2, which shows a schematic view of an RNA expression cassette having an RNA Pol 
m promoter 100 operatively linked to an RNA coding region, having a sense region 110, a 
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loop region 120, an antisense region 130 and a terminator region 140. The sense 110 and 
antisense 130 regions are each preferably about 15 to about 30 nucleotides in length. The 
loop region 120 preferably is about 2 to about 15 nucleotides in length, more preferably 
from about 4 to about 9 nucleotides in length. Following expression the sense and 
antisense regions form a duplex. 

[0117] In another embodiment, the retroviral construct comprises two RNA 
coding regions. The first coding region is a template for the expression of a first RNA and 
the second coding region is a template for the expression of a second RNA. Following 
expression, the first and second RNA's form a duplex. The retroviral construct preferably 
also comprises a first Pol IQ promoter operably linked to the first RNA coding region and a 
second Pol HI promoter operably linked to the second RNA coding region. This 
embodiment is illustrated in Figure 3, which shows a schematic view of an RNA expression 
cassette having an RNA Polymerase ID promoter 100 linked to a first RNA coding region 
110 and a first terminator sequence 140 and a second RNA polymerase HI promoter 105 
linked to a second RNA coding region 115 and a second terminator 145. 

[0118] In yet another embodiment of the invention, the retroviral construct 
comprises a first RNA Pol EI promoter operably linked to a first RNA coding region, and a 
second RNA Pol JR promoter operably linked to the same first RNA coding region in the 
opposite direction, such that expression of the RNA coding region from the first RNA Pol 
HI promoter results in a synthesis of a first RNA molecule as the sense strand and 
expression of the RNA coding region from the second RNA Pol EI promoter results in 
synthesis of a second RNA molecule as an antisense strand that is substantially 
complementary to the first RNA molecule. In one such embodiment, both RNA 
Polymerase HI promoters are separated from the RNA coding region by termination 
sequences, preferably termination sequences having five consecutive T residues. Figure 4 
shows a schematic view of an RNA expression cassette having a first RNA Polymerase III 
promoter 100 linked to an RNA coding region 110 and a first terminator sequence 145. 
The expression cassette has a second RNA polymerase HI promoter 105 linked to the RNA 
coding region 115, the same sequence as 110 in reverse, and a second terminator 140. 

[0119] In further embodiments an RNA duplex is expressed using two or more 
retroviral constructs. In one embodiment, a first retroviral construct is used that directs the 
expression of a first RNA and a second retroviral construct is used that directs expression 
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of a second RNA that is complementary to the first. Following expression the first and 
second RNAs form a duplex region. It is preferred, however, that the entire duplex region 
is introduced using retroviral particles derived from a single retroviral construct. As 
discussed above, several strategies for expressing a duplex RNA from a single viral 
construct are shown in Figures 2-4. 

[0120] The RNA duplexes may be flanked by single stranded regions on one or 
both sides of the duplex. For example, in the case of the hairpin, the single stranded loop 
region would connect the duplex region at one end. 

[0121] The RNA coding region is generally operatively linked to a terminator 
sequence. The pol III terminators preferably comprise of stretches of 4 or more thymidine 
("T") residues. In a preferred embodiment, a cluster of 5 consecutive Ts is linked 
immediately downstream of the RNA coding region to serve as the terminator. In such a 
construct pol HI transcription is terminated at the second or third T of the DNA template, 
and thus only 2 to 3 uridine ("U") residues are added to the 3' end of the coding sequence. 

[0122] The sequence of the RNA coding region, and thus the sequence of the 
RNA duplex, preferably is chosen to be complementary to the sequence of a gene whose 
expression is to be downregulated in a cell or organism. The degree of down regulation 
achieved with a given RNA duplex sequence for a given target gene will vary by sequence. 
One of skill in the art will be able to readily identify an effective sequence. For example, in 
order to maximize the amount of suppression, a number of sequences can be tested in cell 
culture prior to treating a target cell or generating a transgenic animal. As an understanding 
of the sequence requirements for RNA interference is determined, the RNA duplex can be 
selected by one of skill in the art. 

Inhibition of Viral Replication and/or Gene Expression in a Target Cell 

[0123] According to one aspect of the invention, the target of the RNA duplex is 
a sequence that is necessary for the life cycle or replication of a virus, including for 
example, gene expression of the virus and the expression of a cellular receptor or co- 
receptor necessary for viral replication. In one embodiment of the invention, the virus to be 
inhibited is the human immunodeficiency virus (HIY). 

[0124] The invention also includes methods of treating a patient having a viral 
infection. In one embodiment the method comprises administering to the patient an 
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effective amount of a recombinant retroviral particle (or particles) encoding at least one 
double stranded RNA having at least 90% homology and preferably identical to a region of 
at least about 15 to 25 nucleotides in a nucleotide that is important for normal viral 
replication. For example, the double stranded RNA may have homology to a nucleic acid 
in a viral genome, a viral gene transcript or in a gene for a patient's cellular receptor that is 
necessary for the life cycle of the virus. 

[0125] In one embodiment, the patient to be treated is infected with the human 
immunodeficiency virus. A target cell is removed from a patient prior to treatment with the 
recombinant virus particle. In a preferred embodiment, the target cell is a CD34-positive 
hematopoietic stem cell. Such stem cells can be purified by one of skill in the art. Methods 
for such purification are known and taught for example in U.S.Patents: 4,965,204; 
4,714,680; 5,061,620; 5,643,741; 5,677,136; 5,716,827; 5,750,397 and 5,759,793. One 
method for purifying such CD34-positive stem cells involves centrifugation of peripheral 
blood samples to separate mononuclear cells and granulocytes and sorting by fluorescence 
activated cell sorting (FACS). Sorted cells can be enriched for CD34+ cells through any of 
the above techniques. In a particular embodiment, the cells are enriched for CD34+ cells 
through a magnetic separation technology such as that available from Miltenyi Biotec and 
described in the following publications: Kogler et al. (1998) Bone Marrow Transplant. 21: 
233-241; Pasino et al. (2000) Br. J. Haematol. 108: 793-800. The isolated CD34-positive 
stem cell is treated with a recombinant retroviral particle having an RNA coding region 
encoding a double stranded RNA directed against one or more targets within the viral 
genome and/or cellular targets that are necessary for the viral life cycle, including, for 
example, receptors or co-receptors necessary for entry of the pathogenic virus. The treated 
stem cells are then reintroduced into the patient. 

[0126] The methods of the invention can be used to treat a variety of viral 
diseases, including, for example, human immunodeficiency virus (HIV-1 and HIV-2), 
hepatitis A, hepatitis B, hepatitis C. 

[0127] It is also possible to treat a patient with an anti-viral recombinant 
retrovirus in order to confer immunity or increased lesistance for the patient to a desired 
pathogen, such as a virus. 

Cellular Targets 
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[0128] According to the invention, one of skill in the art can target a cellular 
component, either an RNA or an RNA encoding a cellular protein necessary for the 
pathogen life cycle, particularly a viral life cycle. In a preferred embodiment, the cellular 
target chosen will not be a protein or RNA that is necessary for normal cell growth and 
viability. Suitable proteins for disrupting the viral life cycle include, for example, cell 
surface receptors involved in viral entry, including both primary receptors and secondary 
receptors, and transcription factors involved in the transcription of a viral genome, proteins 
involved in integration into a host chromosome, and proteins involved in translational or 
other regulation of viral gene expression. 

[0129] A number of cellular proteins are known to be receptors for viral entry 
into cells. Some such receptors are listed in an article by E. Baranowski, CM. Ruiz-Jarabo, 
and E. Domingo, "Evolution of Cell Recognition by Viruses," Science 292: 1102-1105, 
which is hereby incorporated by reference in its entirety. Some cellular receptors that are 
involved in recognition by viruses are listed below: Adenoviruses: CAR, Integrins, MHC I, 
Heparan sulfate glycoaminoglycan, Siliac Acid; Cytomegalovirus: Heparan sulfate 
glycoaminoglycan; Coxsackieviruses: Integrins, ICAM-1, CAR, MHC I; Hepatitis A: 
murine-like class I integral membrane clycoprotein; Hepatitis C: CD81, Low density 
lipoprotein receptor; HIV (Retroviridae): CD4, CXCR4, Heparan sulfate 
glycoaminoglycan; HSV: Heparan sulfate glycoaminoglycan, PVR, HveB, HveC; 
Influenza Virus: Sialic acid; Measles: CD46, CD55; Poliovirus,: PVR, HveB, HveC; 
Human papillomavirus: Integrins. One of skill in the art will recognize that the invention is 
not limited to use with receptors that are currently known. As new cellular receptors and 
coreceptors are discovered, the methods of the : nvention can be applied to such sequences. 

Human Immunodeficiency Virus (HTV) 
HIV viral targets: 

[0130] In one embodiment of the invention, the retroviral construct has an RNA 
coding region that encodes a double stranded molecule having at least 90% homology to the 
HIV viral RNA genome, an expressed region o f the HIV viral genome, for example, to any 
region of about 19-25 nucleotides in length of the 9-kb transcript of the integrated HIV 
virus, or any of the variously spliced mRNA transcripts of HIV (Schwartz, S; Felber, BK; 
Benko, DM; Fenya, EM; Pavlakis, GN. Cloning and functional analysis of multiply spliced 
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mRNA species of human immunodeficiency virus type 1. J. Virol 1990; 64(6): 2519-29). 
Target regions within the HIV transcripts can be chosen to correspond to any of the viral 
genes, including, for example, HIV-1 LTR, vif y nef, and rev. In another embodiment, the 
RNA coding region encodes a double stranded region having at least 90% homology to a 
receptor or co-receptor of the HIV virus. For example, the primary receptor for HIV entry 
into T cells is CD4. In a preferred embodiment, the co-receptors CXC chemokine receptor 
4 (CXCR4) and CC chemokine receptor 5 (CCR5) are down-regulated according to the 
methods of the invention. CXCR4 (Feddersppiel et al. Genomics 16:707-712 (1993)) is 
the major co-receptor for T cell trophic strains of HIV while CCR5 (Mummidi et al. J. Biol. 
Chem. 272:30662-30671 (1997)) is the major co-receptor for macrophage trophic strains of 
HIV. Other cellular targets against HIV include the RNA transcripts for proteins involved 
in the HIV life cycle, including cyclophilin, CRM-1, importin-6, HP68 (Zimmerman C, et 
al. Identification of a host protein essential for assembly of immature HIV-1 capsids. 
Nature 415 (6867): 88-92 (2002)) and other as yet unknown cellular factors. 

[0131 J The following examples are offered for illustrative purposes only, and 
are not intended to limit the scope of the present invention in any way. Indeed, various 
modifications of the invention in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing description and fall within the scope 
of the appended claims. 

[0132] All patent and literature references cited in the present specification are 
hereby incorporated by reference in their entirety. 

Examples 

Example 1 

[0133] According to this example, an siRNA lentiviral construct against lacZ 
gene was constructed by insertion of the siRNA expression cassette into the Pad site of 
HC-FUGW vector (Figure 5). HC-FUGW vector (SEQ ID NO: 3) contains a GFP marker 
gene driven by human Ubiquitin promoter for tracking transduction events. The vector also 
contains an HTV DNA Flap element to improve the virus titets, and WPRE for high level 
expression of viral genes. The siRNA expression cassette is co/nposed of a pol m promoter 
and a small hairpin RNA coding region followed by a pol ID terminator site. The pol IE 
promoter is derived from r 240 to -8 region of human HI -RNA promoter and is connected 
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to the downstream RNA coding region through a 7 base pair linker sequence to ensure that 
the transcription is precisely initiated at the first nucleotide of the RNA coding sequence. 
The small hairpin RNA coding region contains a 19 nt sequence corresponding to 1915- 
1933 region of the sense strand of lacZ gene coding sequence and the 19 nt perfect reverse 
complementary sequence separated by a 9 nt loop region. The terminator is comprised of 5 
consecutive thymidine residues linked immediately downstream of the RNA coding 
sequence. 

Example 2 

[0134] This example demonstrates the transduction of cultured mammalian cells 
with a retroviral vector (Figure 6). The retroviral vector encoding a small hairpin RNA 
molecule described in Example 1, was used to transfect cultured mammalian cells that 
express lacZ, and caused a profound decrease in the expression of the test gene lacZ. The 
lacZ siRNA virus was produced by cotransfection of the retroviral vector, a helper virus 
plasmid and VSVg expression plasmid in HEK293 cells. The virus particles were 
harvested from the cell culture supernatants and concentrated by ultracentrifugation. The 
concentrated virus preparations were used to infect either mouse embryonic fibroblasts 
(MEF) or HEK293 cells which harbor both lacZ and firefly luciferase (Luc) reporter genes. 
Infection was monitored by the GFP signal which is expressed from the marker gene 
cassette of the viral vector. Under the conditions of this experiment, >98% of the test cells 
were GPF+ and thus successfully transduced. The expression levels of lacZ and Luc 
reporter genes were measured by chemiluminescent assays using commercially available 
kits (lacZ assay kit from Roche and Luc from Promega). The lacZ siRNA virus only 
inhibited the expression of lacZ but not Luc. The specific inhibition was determined by the 
ration of lacZ activity versus Luc activity. The lacZ/Luc ration of the uninfected parental 
cells was arbitrarily set to 1 and the values of the infected cells were calculated accordingly. 
As shown in Figure 6, transfection with the virus resulted in dramatic reduction in the 
amount of expression of the lacZ gene in both MEK and HEK293 cells. 

[0135] A tet-iducible lacZ siRN/ lentiviral vector was also prepared as 
illustrated in Figure 8. A Tet repressor gene ( etR; SEQ ID NO: 7) was placed the under 
the control of the human UbiquitinC promoter so that its expression could be monitored by 
the downstream GFP marker. The anti-lacZ siRNA cassette was driven by a Tet-inducible 
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pol m promoter derived from human U6-promoter (-328 to +1) containing a single TetR 
binding site (TetOl) between the PSE and TATA box (SEQ ID NO: 6). The TetR coding 
sequence was PCR amplified from genomic DNA from the TOP10 strain of E. coli adn 
cloned into a modified version of FUIGW as a Bgl2-EcoRl fragment. In the absence of 
tetracycline, TetR binds to the promoter and its expression is repressed. Upon the addition 
of tetracycline, TetR is moved from the promoter and transcription starts. 

[0136] The Tet-inducible siRNA expression cassette was able to regulate gene 
expression in response to Doxycycline treatment. Virus was prepared from the retroviral 
construct carrying the Tet-inducible lacZ-siRNA cassette and a Tet repressor under the 
control of a UbiquitinC promoter and used to transduce HEK293 cells expressing both lacZ 
and luciferase (293Z+Luc). The transduced cells were treated with 10 ug/ml Doxycycline 
(Plus Dox) for 48hr or without the Doxycycline treatment as a control (No Dox). LacZ and 
luciferase activities were measured as described in the previous figures. The relative 
suppression activity is calculated as the ratio of lacZ versus luciferase and No Dox control 
was arbitrarily set to 1. As can be seen in Figure 9, in the presence of doxycycline 
suppression of lacZ activity was significantly enhanced. 

Example 3 

[0137] This example demonstrates the generation of transgenic animals that 
express an siRNA molecule encoded by a lentiviral vector. The expression of the lacZ 
specific siRNA construct described in Example 1 resulted in extensive suppression of lacZ 
activity in ROSA26+/- mice. ROSA26+/- animals carry one copy of an ubiquitously 
expressed lacZ reporter gene. The lacZ siRNA virus preparations described in Example 2 
were used for perivitelline injection of ROSA26+/- single cell embryos obtained from 
hormone primed C57B1/6 female donors x ROSA26+/+ stud males. The injected single 
cell embryos were subsequently transferred into the oviduct of timed pseudopregnant 
female recipients. Embryonic day 15.5 to 17.5 (E15.5-17.5) fetuses were recovered from 
the surrogate mothers. Successful transgenesis was scored by positive GFP signal observed 
with the fetuses under fluorescent microscope. Protein extracts prepared from the limb 
tissues of the fetuses were used for the LacZ chemi luminescent assay according to the 
manufacturer's instruction (Roche), and protein concentrations of the tissue extracts were 
determined by the Bradford assay (BioRad). The lacZ expression levels were expressed as 
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light units (LU) per ug of proteins (LU/ug). The E15.5-17.5 fetuses from the timed mating 
of C57B1/6 females x ROSA26+/+ males and C57B1/6 females x C57B1/6 males were 
served as the positive and negative controls respectively. The results are shown in Figure 7. 
In animals G1-G4 (those treated with lentiviral vetor encoding the siRNA construct), the 
animals showed markedly decreased expression of the lacZ gene as compared with 
untreated control animals. 

Example 4 

[0138] A lentiviral construct comprising an RNA coding region encoding an 
anti-human CCR5 siRNA was prepared. As illustrated in Figure 10, the vector comprised 
an HIV based lentiviral vector 5' LTR, an HIV Flap element, a human U6-RNA pol II 
promoter (-328 to +1; SEQ ID NO: 4), a human CCR5 specific short hairpin RNA cassette, 
an internal ubiquitin promoter, a GFP marker gene operably linked to the ubiquitin 
promoter a WRE regulatory element and an HIV based self-inactivating lentiviral 3' LTR. 
The structure and sequence of the anti-CCR5 siRNA are provided in Figure 10 and SEQ ED 
NO: 1. 

(0139J Recombinant retrovirus was prepared from the anti-CCR5 siRNA vector 
construct as described above. Human MAGI-CCR5 cells (Deng et al., Nature 381:661 
(1996)) were infected with the recombinant virus or a retrovirus encoding a non-specific 
control siRNA and cell surface expression of CCR5 was measured by flow cytometric 
analysis. Relative expression levels were calculated by mean fluorescence intensity. As 
can be seen in Figure 11, the anti-CCR5 siRNA reduced the level of CCR5 expression 
almost completely, while the non-specific siRNA did not suppress expression at all. 

Example 5 

[0140] . A further anti-HIV-1 siRNA encoding lentiviral vector was constructed, 
as illustrated in Figure 12. This vector comprised an RNA coding region encoding an anti- 
HIV-1 Rev gene specific short hairpin siRNA (SEQ ID NO: 2). The anti-HIV-1 Rev 
siRNA targeted the 8420 to 8468 region of the Rev coding of HIV- 1 (nucleotide coordinate 
of NL4-3 strain; Salminen et al. Virology 212:80-86 (1995)). The sequence and structure 
of the siRNA coding region are illustrated in Figure 12 as well. Expression of the anti- 
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HIV-1 Rev siRNA was driven by a human Hl-RNA pol IH promoter (-240 to -9; SEQ ID 
NO: 5). 

[0141] The ability of the anti-HIV-1 Rev siRNA to suppress HTV transcription 
in human cells was measured. The transcriptional activity of HIV-1 was measured based 
on the activity of a firefly luciferase reporter gene inserted at the env/nef region, essentially 
as described in Li et al. J. Virol 65:3973 (1991)). 

[0142] Recombinant retrovirus was prepared from the vector construct as 
described above and used to infect human cells comprising HIV-1 with the reporter 
construct. The luciferase activity of untransduced parental cells was arbitrarily set to 1 and 
the relative HIV transcription levels of the transduced cells were calculated accordingly. A 
non-specific siRNA was used as a control. 

[0143] As can be seen in Figure 13, HIV-1 transcription was significantly 
suppressed in cells infected with the recombinant retrovirus comprising the anti-HIV-1 Rev 
siRNA coding region, while the non-specific siRNA had no significant effect. 

Example 6 

[0144] According to this example, an siRNA lentiviral construct against the 
HTV genome is constructed by insertion of an siRNA expression cassette into the Pad site 
of HC-FUGW vector. HC-FUGW vector contains a GFP marker gene driven by human 
Ubiquitin promoter for tracking transduction events. The vector also contains an HIV DNA 
Flap element to improve the virus titers, and WPRE for high level expression of viral 
genes. The siRNA expression cassette is composed of a pol III promoter and a small 
hairpin RNA coding region followed by a pol HI terminator site. The pol m promoter is 
derived from -240 to -8 region of human Hl-RNA promoter and is connected to the 
downstream RNA coding region through a 7 base pair linker sequence to ensure that the 
transcription is precisely initiated at the first nucleotide of the RNA coding sequence. The 
small hairpin RNA coding region contains a 21 nt sequence corresponding to a region of 
the CCR5 coding sequence and the 21 nt perfect reverse complementary sequence separated 
by a 4 nt loop region. The terminator is comprised of 5 consecutive thymidine residues 
linked immediately downstream of the RNA coding sequence.. 
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[0145) The retroviral construct is used to transfect a packaging cell line 
(HEK293 cells) along with a helper virus plasmid and VSVg expression plasmid. The 
recombinant viral particles are recovered. 

[0146] CD34-positive hematopoietic stem cells are isolated from a patient's 
bone marrow using a immunomagnetic approach (see, for example, Choi et al.(1995) Blood 
85:402-413; Fehse et al. (1997) Human Gene Therapy 8:1815-1824; Di Nicola et al.(1996) 
Bone Marrow Transplant. 18:1117-1121; Servida et al.(1996) Stem Cells 14:430-438; de 
Wynter et al.(1995) Stem Cells 13:524-532; Ye et al.(1996) Bone Marrow Transplant. 
18:997-1008.). The cells are cultured and treated with the recombinant virus particles. The 
infected cells are sorted by FACS based on expression of GFP. Those cells expressing GFP 
are reintroduced into a patient by injection. 

Example 7 

[0147] According to this example, an siRNA lentiviral construct against the 
HIV genome is constructed by insertion of an siRNA expression cassette into the Pad site 
of HC-FUGW vector. The siRNA expression cassette comprises a human HI promoter 
operatively linked to an RNA coding region encoding an anti-HTVM Rev gene specific 
short hairpin siRNA. The siRNA expression cassette additionally comprises a pol EI 
promoter operatively linked to a small anti-CCR5 hairpin RNA. The retroviral construct is 
illustrated in Figure 14. 

[0148] The retroviral construct is used to transfect a packaging cell line 
(HEK293 cells) along with a helper virus plasmid and VSVg expression plasmid. The 
recombinant viral particles are recovered. 

[0149] CD34-positive hematopoietic stem cells are isolated from a patient's 
bone marrow using a immunomagnetic approach (see, for example, Choi et al.(1995) Blood 
85:402-413; Fehse et al. (1997) Human Gene Therapy 8:1815-1824; Di Nicola et al.(1996) 
Bone Marrow Transplant. 18:1117-1121; Servida et al.(1996) Stem Cells 14:430-438; de 
Wynter et al.(1995) Stem Cells 13:524-532; Ye et al.(1996) Bone Marrow Transplant. 
18:997-1008.). The cells are cultured and treated with the recombinant virus particles. The 
infected cells are sorted by FACS based on exp ression of GFP. Those cells expressing GFP 
are reintroduced into a patient by injection. 
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WHAT IS CLAIMED IS: 

1. A method of expressing an RNA molecule within a cell, the method 
comprising: 

transfecting a packaging cell line with a retroviral construct; 

recovering a recombinant retrovirus from the packaging cell line; and 

infecting a target cell with the recombinant retrovirus, 

wherein the recombinant retrovirus construct comprises a first RNA 
polymerase JR promoter region, a first RNA coding region, and a first termination 
sequence, and 

wherein the RNA coding region comprises a sequence that is at least about 
90% identical to a target region of a pathogenic virus genome or genome transcript 
or a target cell gene involved in the pathogenic virus life cycle. 

2. The method of claim 1, wherein the retroviral construct further comprises 
the R and U5 sequences from a 5' lenti viral long terminal repeat (LTR) and a self- 
inactivating lentiviral 3' LTR. 

3. The method of claim 1, wherein the RNA coding region encodes an RNA 
molecule having a sense region, an antisense region and a loop region, and wherein the 
sense region is substantially complementary to the antisense region. 

4. The method of claim 3> wherein the loop region is about 2 to about 15 
nucleotides in length. 

5. The method of claim 1, wherein the target region is between about 15 and 
about 30 nucleotides in length. 

6. The method of claim 5, wherein the target region is from 18 to 23 
nucleotides in length. ? 

7. The method of claim 1, wherein the pathogenic virus is human 
immunodeficiency virus. 

8. The method of claim 6, wherein the target region is a region of a human 

gene transcript selected from the group consisting of CCR5 and CXCR4. 
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9. The method of claim 6, wherein the target region is a region of the human 
immunodeficiency virus transcript. 

10. The method of claim 1, wherein the pathogenic virus is selected from the 
group consisting of hepatitis C virus (HCV), cytomegalovirus (CMV), herpes simplex 
virus (HSV), influenza virus, an adenovirus and poliovirus. 

1 1 . The method of claim 1 , wherein the first RNA coding region encodes a first 
RNA molecule, and the retroviral construct further comprises a second RNA polymerase EI 
promoter and a second RNA coding region operably linked to the second RNA polymerase 
lH promoter, wherein the second RNA coding region encodes a second RNA molecule 
substantially complementary to the first RNA molecule at the target region. 

12. The method of claim 1, wherein the retroviral construct further comprises a 
second RNA polymerase IE promoter region operably linked to the first RNA coding 
region, such that expression of the first RNA coding region from the first RNA polymerase 
HI promoter results in a synthesis of a first RNA molecule and expression of the first RNA 
coding region from the second RNA polymerase IQ promoter results in synthesis of a 
second RNA molecule substantially complementary to the first RNA molecule at the target 
region. 

13. The method of claim 1 wherein said packaging cell line is an HEK293 cell 

line. 

14. The method of claim 1 wherein the 5' LTR sequences are from HIV. 

15. The method of claim 1 wherein the self-inactivating 3' LTR comprises a U3 
element with a deletion of its enhancer sequence. 

16. The method of claim 15, wherein the self-inactivating 3' LTR is a modified 
HIV 3' LTR. 

1 7. The method of claim 1 , wherein the recombinant retrovirus is pseudotyped. 
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18. The method of claim 17, wherein the recombinant retrovirus is pseudotyped 
with the vesicular stomatitits virus envelope glycoprotein. 

19. The method of claim 1 wherein the 5' LTR sequences are from Moloney 
Murine Leukemia Virus. 

20. The method of claim 1 wherein the 5'LTR sequences are from murine stem 
cell virus (MSCV). 

2 1 . The method of claim 1 , wherein the target cell is a human cell. 

22. The method of claim 2 1 , wherein the target cell is a hematopoietic cell. 

23. The method of claim 22, wherein the target cell is a CD34-positive 
hematopoietic cell. 

24. The method of claim 23, further comprising isolating the target CD34- 
positive hematopoietic cells from a patient. 

25. The method of claim 24, further comprising reintroducing the infected 
CD34-positive hematopoietic into the patient. 

26. The method of claim 1, wherein the cell is a cultured cell. 

27. The method of claim 1, wherein the cell is a human cell in vivo. 

28. A method of treating a patient infected with HIV, the method comprising: 
isolating a CD34-positive target cell from a patient; 

infecting the target cell with a recombinant retrovirus recovered from a 
packaging cell line transfected with a retroviral construct, 

wherein the recombinant retroviral construct comprises a first RNA 
polymerase EI promoter region, at least one RNA coding region, and at least one 
termination sequence, and 



-36- 



WO 03/023015 



PCT/US02/29214 



wherein the RNA coding region has a sequence that is at least about 90% 
identical to a target region of the HIV genome, an HIV genome transcript or another 
gene involved in the virus life cycle. 

29. The method of claim 28, wherein the retroviral construct further comprises 
. the R and U5 sequences from a 5* lentiviral long terminal repeat (LTR) and a self- 
inactivating lenti viral 3' LTR. 

30. The method of claim 28, wherein the RNA coding region encodes an RNA 
molecule having a sense region, an antisense region and a loop region, wherein the sense 
region is substantially complementary to the antisense region. 

31. The method of claim 28, wherein the RNA coding region encodes a first 
RNA molecule, and the retroviral construct further comprises a second RNA polymerase in 
promoter and a second RNA coding region operably linked to the second RNA polymerase 
HI promoter, wherein the second RNA coding region encodes a second RNA molecule 
substantially complementary to the first RNA molecule at the target region. 

32. The method of claim 28, wherein the retroviral construct further comprises a 
second RNA polymerase HI promoter region operably linked to the RNA coding region, 
such that expression of the RNA coding region from the first RNA polymerase HI promoter 
results in a synthesis of a first RNA molecule and expression of the RNA coding region 
from the second RNA polymerase HI promoter results in synthesis of a second RNA 
molecule substantially complementary to the first RNA molecule at the target region. 

33. The method of claim 28, wherein the target region is from 18 to 23 
nucleotides in length. 

34. The method of claim 28, wherein the gene is selected from the group 
consisting of CCR5 and CXCR4. 
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FIGURE 1A 
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FIGURE 2 
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FIGURES 
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FIGURE 6 
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FIGURE 7 
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FIGURE 8 
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FIGURE 9 
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FIGURE 10 
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FIGURE 11 
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FIGURE 13 
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FIGURE 14 
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SEQUENCE LISTING 

<110> CALIFORNIA INSTITUTE OF CALIFORNIA 
QIN, XIAO- FENG 
LO IS- CAB ALLE , CARLOS 
BALTIMORE, DAVID 

<120> METHOD FOR EXPRESSION OF SMALL ANTIVIRAL 
RNA MOLECULES WITHIN A CELL 



<130> CALTE.011VPC 

<150> 60/322,031 
<151> 2001-09-13 

<150> 60/347,782 

<151> 2002-01-09 

<150> Not yet available (Attorney Docket No.: CALTE.011PR) 

<151> 2002-08-27 

<160> 7 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This represents an ant i -human specific siRNA 

cassette comprising human sequence and synthetic 
linker, loop and terminator sequences. 

<400> 1 

accgagcatg actgacatct acttcaagag agtagatgtc agtcatgctc tttttc 56 

<210> 2 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> This represents an anti-human immune deficiency 
virus specific siRNA cassette comprising human 
immune deficiency virus sequence and synthetic 
linker, loop and terminator sequences. 

<400> 2 

gatccccgaa ggtggagaga gagacattca agagatgtct ctctctccac cttctttttc 60 
<210> 3 
<211> 9941 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> This sequence represents a lentiviral vector 

comprising a human immunodeficiency virus flap 
sequence, a green fluorescent protein variant 
sequence, a human ubiquitin promoter sequence and 
a woodchuck hepatitis regulator element sequence. 

<400> 3 

gtcgacggat cgggagatct cccgatcccc tatggtgcac tctcagtaca atctgctctg 60 

atgccgcata gttaagccag tatctgctcc ctgcttgtgt gttggaggtc gctgagtagt 120 

gcgcgagcaa aatttaagct acaacaaggc aaggcttgac cgacaattgc atgaagaatc 180 

tgcttagggt taggcgtttt gcgctgcttc gcgatgtacg ggccagatat acgcgttgac 240 

attgattatt gactagttat taatagtaat caattacggg gtcattagtt catagcccat 300 

atatggagtt ccgcgttaca taacttacgg taaatggccc gcctggctga ccgcccaacg 360 

acccccgccc attgacgtca ataatgacgt atgttcccat agtaacgcca atagggactt 42 0 

tccattgacg tcaatgggtg gagtatttac ggtaaactgc ccacttggca gtacatcaag 480 

tgtatcatat gccaagtacg ccccctattg acgtcaatga cggtaaatgg cccgcctggc 54 0 

attatgccca gtacatgacc ttatgggact ttcctacttg gcagtacatc tacgtattag 600 

tcatcgctat taccatggtg atgcggtttt ggcagtacat caatgggcgt ggatagcggt 660 

ttgactcacg gggatttcca agtctccacc ccattgacgt caatgggagt ttgttttggc 720 

accaaaatca acgggacttt ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg 780 

gcggtaggcg tgtacggtgg gaggtctata taagcagcgc gttttgcctg tactgggtct 84 0 

ctctggttag accagatctg agcctgggag ctctctggct aactagggaa cccactgctt 900 

aagcctcaat aaagcttgcc ttgagtgctt caagtagtgt gtgcccgtct gttgtgtgac 960 

tctggtaact agagatccct cagacccttt tagtcagtgt ggaaaatctc tagcagtggc 1020 

gcccgaacag ggacttgaaa gcgaaaggga aaccagagga gctctctcga cgcaggactc 1080 

ggcttgctga agcgcgcacg gcaagaggcg aggggcggcg actggtgagt acgccaaaaa 1140 

ttttgactag cggaggctag aaggagagag atgggtgcga gagcgtcagt attaagcggg 1200 

ggagaattag atcgcgatgg gaaaaaattc ggttaaggcc agggggaaag aaaaaatata 1260 

aattaaaaca tatagtatgg gcaagcaggg agctagaacg attcgcagtt aatcctggcc 1320 

tgttagaaac atcagaaggc tgtagacaaa tactgggaca gctacaacca tcccttcaga 1380 

caggatcaga agaacttaga tcattatata atacagtagc aaccctctat tgtgtgcatc 1440 

aaaggataga gataaaagac accaaggaag ctttagacaa gatagaggaa gagcaaaaca 1500 

aaagtaagac caccgcacag caagcggccg ctgatcttca gacctggagg aggagatatg 1560 

agggacaatt ggagaagtga attatataaa tataaagtag taaaaattga accattagga 1620 

gtagcaccca ccaaggcaaa gagaagagtg gtgcagagag aaaaaagagc agtgggaata 1680 

ggagctttgt tccttgggtt cttgggagca gcaggaagca ctatgggcgc agcgtcaatg 174 0 

acgctgacgg tacaggccag acaattattg tctggtatag tgcagcagca gaacaatttg 1800 

ctgagggcta ttgaggcgca acagcatctg ttgcaactca cagtctgggg catcaagcag 1860 

ctccaggcaa gaatcctggc tgtggaaaga tacctaaagg atcaacagct cctggggatt 1920 

tggggttgct ctggaaaact catttgcacc actgctgtgc cttggaatgc tagttggagt 1980 

aataaatctc tggaacagat ttggaatcac acgacctgga tggagtggga cagagaaatt 2040 

aacaattaca caagcttaat acactcctta attgaagaat cgcaaaacca gcaagaaaag 2100 

aatgaacaag aattattgga attagataaa tgggcaagtt tgtggaattg gtttaacata 2160 

acaaattggc tgtggtatat aaaattattc ataatgatag taggaggctt ggtaggttta 2220 

agaatagttt ttgctgtact ttctatagtg aatagagtta ggcagggata ttcaccatta 2280 

tcgtttcaga cccacctccc aaccccgagg ggacccgaca ggcccgaagg aatagaagaa 234 0 

gaaggtggag agagagacag agacagatcc attcgattag tgaacggatc ggcactgcgt 2400 

gcgccaattc tgcagacaaa tggcagtatt catccacaat tttaaaagaa aaggggggat 2460 

tggggggtac agtgcagggg aaagaatagt agacataata gcaacagaca tacaaactaa 252 0 

agaattacaa aaacaaatta caaaaattca aaattttcgg gtttattaca gggacagcag 2580 

agatccagtt tggttaatta agggtgcagc ggcctccgcg ccgggttttg gcgcctcccg 2640 

cgggcgcccc cctcctcacg gcgagcgctg ccacgtcaga cgaagggcgc aggagcgttc 2700 

ctgatccttc cgcccggacg ctcaggacag cggcccgctg ctcataagac tcggccttag 2760 

aaccccagta tcagcagaag gacattttag gacgggactt gggtgactct agggcactgg 2820 

ttttctttcc agagagcgga acaggcgagg aaaagtagtc ccttctcggc gattctgcgg 2880 
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agggatctcc gtggggcggt gaacgccgat gattatataa ggacgcgccg ggtgtggcac 2940 

agctagttcc gtcgcagccg ggatttgggt cgcggttctt gtttgtggat cgctgtgatc 3000 

gtcacttggt gagttgcggg ctgctgggct ggccggggct ttcgtggccg ccgggccgct 3060 

cggtgggacg gaagcgtgtg gagagaccgc caagggctgt agtctgggtc cgcgagcaag 3120 

gttgccctga actgggggtt ggggggagcg cacaaaatgg cggctgttcc cgagtcttga 3180 

atggaagacg cttgtaaggc gggctgtgag gtcgttgaaa caaggtgggg ggcatggtgg 3240 

gcggcaagaa cccaaggtct tgaggccttc gctaatgcgg gaaagctctt attcgggtga 3300 

gatgggctgg ggcaccatct ggggaccctg acgtgaagtt tgtcactgac tggagaactc 3360 

gggtttgtcg tctggttgcg ggggcggcag ttatgcggtg ccgttgggca gtgcacccgt 3420 

acctttggga gcgcgcgcct cgtcgtgtcg tgacgtcacc cgttctgttg gcttataatg 34 80 

cagggtgggg ccacctgccg gtaggtgtgc ggtaggcttt tctccgtcgc aggacgcagg 3540 

gttcgggcct agggtaggct ctcctgaatc gacaggcgcc ggacctctgg tgaggggagg 3600 

gataagtgag gcgtcagttt ctttggtcgg ttttatgtac ctatcttctt aagtagctga 3660 

agctccggtt ttgaactatg cgctcggggt tggcgagtgt gttttgtgaa gttttttagg 3720 

caccttttga aatgtaatca tttgggtcaa tatgtaattt tcagtgttag actagtaaag 3780 

cttctgcagg tcgactctag aaaattgtcc gctaaattct ggccgttttt ggcttttttg 3840 

ttagacagga tccccgggta ccggtcgcca ccatggtgag caagggcgag gagctgttca 3900 

ccggggtggt gcccatcctg gtcgagctgg acggcgacgt aaacggccac aagttcagcg 3960 

tgtccggcga gggcgagggc gatgccacct acggcaagct gaccctgaag ttcatctgca 4020 

ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac caccctgacc tacggcgtgc 4080 

agtgcttcag ccgctacccc gaccacatga agcagcacga cttcttcaag tccgccatgc 4140 

ccgaaggcta cgtccaggag cgcaccatct tcttcaagga cgacggcaac tacaagaccc 4200 

gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg catcgagctg aagggcatcg 4260 

acttcaagga ggacggcaac atcctggggc acaagctgga gtacaactac aacagccaca 4320 

acgtctatat catggccgac aagcagaaga acggcatcaa ggtgaacttc aagatccgcc 4380 

acaacatcga ggacggcagc gtgcagctcg ccgaccacta ccagcagaac acccccatcg 4440 

gcgacggccc cgtgctgctg cccgacaacc actacctgag cacccagtcc gccctgagca 4500 

aagaccccaa cgagaagcgc gatcacatgg tcctgctgga gttcgtgacc gccgccggga 4560 

tcactctcgg catggacgag ctgtacaagt aaagcggccg cgactctaga attcgatatc 4620 

aagcttatcg ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt 4680 

aactatgttg ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct 4740 

attgcttccc gtatggcttt cattttctcc tccttgtata aatcctggtt gctgtctctt 4800 

tatgaggagt tgtggcccgt tgtcaggcaa cgtggcgtgg tgtgcactgt gtttgctgac 4 860 

gcaaccccca ctggttgggg cattgccacc acctgtcagc tcctttccgg gactttcgct 4920 

ttccccctcc ctattgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca 4 980 

ggggctcggc tgttgggcac tgacaattcc gtggtgttgt cggggaaatc atcgtccttt 5040 

ccttggctgc tcgcctgtgt tgccacctgg attctgcgcg ggacgtcctt ctgctacgtc 5100 

ccttcggccc tcaatccagc ggaccttcct tcccgcggcc tgctgccggc tctgcggcct 5160 

cttccgcgtc ttcgccttcg ccctcagacg agtcggatct ccctttgggc cgcctccccg 5220 

catcgatacc gtcgacctcg agacctagaa aaacatggag caatcacaag tagcaataca 5280 

gcagctacca atgctgattg tgcctggcta gaagcacaag aggaggagga ggtgggtttt 5340 

ccagtcacac ctcaggtacc tttaagacca atgacttaca aggcagctgt agatcttagc 5400 

cactttttaa aagaaaaggg gggactggaa gggctaattc actcccaacg aagacaagat 5460 

atccttgatc tgtggatcta ccacacacaa ggctacttcc ctgattggca gaactacaca 5520 

ccagggccag ggatcagata tccactgacc tttggatggt gctacaagct agtaccagtt 5580 

gagcaagaga aggtagaaga agccaatgaa ggagagaaca cccgcttgtt acaccctgtg 5640 

agcctgcatg ggatggatga cccggagaga gaagtattag agtggaggtt tgacagccgc 5700 

ctagcatttc atcacatggc ccgagagctg catccggact gtactgggtc tctctggtta 5760 

gaccagatct gagcctggga gctctctggc taactaggga acccactgct taagcctcaa 5820 

taaagcttgc cttgagtgct tcaagtagtg tgtgcccgtc tgttgtgtga ctctggtaac 5880 

tagagatccc tcagaccctt ttagtcagtg tggaaaatct ctagcagggc ccgtttaaac 5940 

ccgctgatca gcctcgactg tgccttctag ttgccagcca tctgttgttt gcccctcccc 6000 

cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga 6060 

aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga 6120 

cagcaagggg gaggattggg aagacaatag caggcatgct ggggatgcgg tgggctctat 6180 

ggcttctgag gcggaaagaa ccagctgggg ctctaggggg tatccccacg cgccctgtag 6240 

cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag 6300 
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cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt 6360 

tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca 6420 

cctcgacccc aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata 6480 

gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca 6540 

aactggaaca acactcaacc ctatctcggt ctattctttt gatttataag ggattttgcc 6600 

gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattaatt 6660 

ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag gctccccagc aggcagaagt 6720 

atgcaaagca tgcatctcaa ttagtcagca accaggtgtg gaaagtcccc aggctcccca 6780 

gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccatagt cccgccccta 6840 

actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 6900 

ctaatttttt ttatttatgc agaggccgag gccgcctctg cctctgagct attccagaag 6960 

tagtgaggag gcttttttgg aggcctaggc ttttgcaaaa agctcccggg agcttgtata 7020 

tccattttcg gatctgatca gcacgtgttg acaattaatc atcggcatag tatatcggca 7080 

tagtataata cgacaaggtg aggaactaaa ccatggccaa gttgaccagt gccgttccgg 7140 

tgctcaccgc gcgcgacgtc gccggagcgg tcgagttctg gaccgaccgg ctcgggttct 72 00 

cccgggactt cgtggaggac gacttcgccg gtgtggtccg ggacgacgtg accctgttca 7260 

tcagcgcggt ccaggaccag gtggtgccgg acaacaccct ggcctgggtg tgggtgcgcg 7320 

gcctggacga gctgtacgcc gagtggtcgg aggtcgtgtc cacgaacttc cgggacgcct 7380 

ccgggccggc catgaccgag atcggcgagc agccgtgggg gcgggagttc gccctgcgcg 7440 

acccggccgg caactgcgtg cacttcgtgg ccgaggagca ggactgacac gtgctacgag 7500 

atttcgattc caccgccgcc ttctatgaaa ggttgggctt cggaatcgtt ttccgggacg 7560 

ccggctggat gatcctccag cgcggggatc tcatgctgga gttcttcgcc caccccaact 7620 

tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat ttcacaaata 7680 

aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat gtatcttatc 7740 

atgtctgtat accgtcgacc tctagctaga gcttggcgta atcatggtca tagctgtttc 7800 

ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt 7860 

gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc 7920 

ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg 7980 

ggagaggcgg tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct 8040 

cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 8100 

cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 8160 

accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 8220 

acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 8280 

cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 8340 

acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 84 00 

atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 8460 

agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 8520 

acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 8580 

gtgctacaga gttcttgaag tggtggccta actacggcta cactagaaga acagtatttg 8640 

gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 8700 

gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 8760 

gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gc'tcagtgga 8820 

acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga 8880 

tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 8940 

ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 9000 

catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat 9060 

ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag 9120 

caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct 9180 

ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt 9240 

tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg 9300 

cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca 9360 

aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt 9420 

tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat 9480 

gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac 9540 

cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa 9600 

aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt 9660 

tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt 9720 
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tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa 

gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt 

atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa 

taggggttcc gcgcacattt ccccgaaaag tgccacctga c 

<210> 4 

<211> 334 

<212> DNA 

<213> Homo sapiens 



9780 
9840 
9900 
9941 



<400> 4 

ccccagtgga 

gcgcgcgcca 

atacgataca 

ttagtacaaa 

ttatgtttta 

gctttatata 



aagacgcgca 
aggtcgggca 
aggctgttag 
atacgtgacg 
aaatggacta 
tcttgtggaa 



ggcaaaacgc 
ggaagagggc 
agagataatt 
tagaaagtaa 
tcatatgctt 
aggacgaaac 



accacgtgac 
ctatttccca 
agaattaatt 
taatttcttg 
accgtaactt 
accg 



ggagcgtgac cgcgcgccga 60 
tgattccttc atatttgcat 120 
tgactgtaaa cacaaagata 180 
ggtagtttgc agttttaaaa 240 
gaaagtattt cgatttcttg 300 

334 



<210> 5 

<211> 246 

<212> DNA 

<213> Homo sapiens 



<400> 5 

tctagaccat ggaattcgaa cgctgacgtc 
gtgtcactag gcgggaacac ccagcgcgcg 
acaggggagt ggcgccctgc aatatttgca 
aacgtgaaat gtctttggat ttgggaatct 
aagctt 

<210> 6 
<211> 355 
<212> DNA 

<213> Artificial Sequence 



atcaacccgc tccaaggaat cgcgggccca 60 
tgcgccctgg caggaagatg gctgtgaggg 120 
tgtcgctatg tgttctggga aatcaccata 180 
tataagttct gtatgagacc acggatccaa 24 0 

246 



<220> 

<223> This represents a mutant human sequence having an 
introduced bacterial tetOl binding site. 



<400> 6 

gggaattccc ccagtggaaa gacgcgcagg caaaacgcac cacgtgacgg agcgtgaccg 60 

cgcgccgagc ccaaggtcgg gcaggaagag ggcctatttc ccatgattcc ttcatatttg 120 

catatacgat acaaggctgt tagagagata attagaatta atttgactgt aaacacaaag 180 

atattagtac aaaatacgtg acgtagaaag taataatttc ttgggtagtt tgcagtttta 24 0 

aaattatgtt ttaaaatgga ctatcatatg cttaccgtaa cttgaaagta ctctatcatt 300 

gatagagtta tatatcttgt ggaaaggacg aaacaccgtg gtcttcaagc ttccg 355 

<210> 7 
<211> 634 
<212> DNA 
<213> E. coli 



<400> 7 

gctagccacc atgtccagat tagataaaag 
taatgaggtc ggaatcgaag gtttaacaac 
gcagcctaca ttgtattggc atgtaaaaaa 
tgagatgtta gataggcacc atactcactt 
ttttttacgt aataacgcta aaagttttag 



taaagtgatt aacagcgcat tagagctgct 60 
ccgtaaactc gcccagaagc taggtgtaga 120 
taagcgggct ttgctcgacg ccttagccat 180 
ttgcccttta gaaggggaaa gctggcaaga 24 0 
atgtgcttta ctaagtcatc gcgatggagc 300 
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aaaagtacat ttaggtacac ggcctacaga 

agccttttta tgccaacaag gtttttcact 

ggggcatttt actttaggtt gcgtattgga 

aagggaaaca cctactactg atagtatgcc 

tgatcaccaa ggtgcagagc cagccttctt 

agaaaaacaa cttaaatgtg aaagtgggtc 



aaaacagtat gaaactctcg aaaatcaatt 360 
agagaatgca ttatatgcac tcagcgctgt 420 
agatcaagag catcaagtcg ctaaagaaga 4 80 
gccattatta cgacaagcta tcgaattatt 540 
attcggcctt gaattgatca tatgcggatt 600 
ttaa 634 
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